Load-Balancing Spatially Located Computations using Rectangular Partitions
نویسندگان
چکیده
Distributing spatially located heterogeneous workloads is an important problem in parallel scientific computing. We investigate the problem of partitioning such workloads (represented as a matrix of non-negative integers) into rectangles, such that the load of the most loaded rectangle (processor) is minimized. Since finding the optimal arbitrary rectangle-based partition is an NP-hard problem, we investigate particular classes of solutions: rectilinear, jagged and hierarchical. We present a new class of solutions called m-way jagged partitions, propose new optimal algorithms for m-way jagged partitions and hierarchical partitions, propose new heuristic algorithms, and provide worst case performance analyses for some existing and new heuristics. Moreover, the algorithms are tested in simulation on a wide set of instances. Results show that two of the algorithms we introduce lead to a much better load balance than the state-of-the-art algorithms. We also show how to design a two-phase algorithm that reaches different time/quality tradeoff.
منابع مشابه
Load Balancing Unstructured Adaptive Grids for CFD Problems
Mesh adapt(on is a powerful tool for efficient unstructured-grid computations but causes load imbalance among processors on a parallel machine. A dynamic load balancing method is presented that balances the workload across all processors with a global view. After each parallel tetrahedral mesh adapt(on, the method first determines if the new mesh is sufficiently unbalanced to warrant a repartit...
متن کاملExperiences with Mesh-like computations using Prediction Binary Trees
In this paper we aim at exploiting the temporal coherence among successive phases of a computation, in order to implement a load-balancing technique in mesh-like computations to be mapped on a cluster of processors. A key concept, on which the load balancing schema is built on, is the use of a Predictor component that is in charge of providing an estimation of the unbalancing between successive...
متن کاملEfficient Support for Matrix Computations on Heterogeneous Multi-core and Multi-GPU Architectures
We present a new methodology for utilizing all CPU cores and all GPUs on a heterogeneous multicore and multi-GPU system to support matrix computations efficiently. Our approach is able to achieve four objectives: a high degree of parallelism, minimized synchronization, minimized communication, and load balancing. Our main idea is to treat the heterogeneous system as a distributed-memory machine...
متن کاملPerspectives on asynchronous computations for fluid flow problems
Instances where asynchronous parallel computations can be used for the solution of fluid flow problems are discussed. The use of asynchronous portions of solvers can effectively counteract the consequences of the inefficiencies associated with high latency in the interprocessor (or intercomputer) communication. This is especially so in the case of irregular regions, irregular partitions, or in ...
متن کاملTask-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning
Quantum molecular dynamics (QMD) simulations are highly accurate, but they are computationally expensive due to the calculation of the ground-state electronic density matrix P via an O(N) diagonalization. Second-order spectral projection (SP2) is an efficient O(N) alternative to obtain P from a Hamiltonian matrix H. This poster presents a dataparallel version of the SP2 algorithm that uses an u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 72 شماره
صفحات -
تاریخ انتشار 2012